AMultivariate Approach to Integrating Datasets using made4 and ade4
نویسندگان
چکیده
The public microarray repositories, ArrayExpress and the GeneExpression Omnibus (GEO), now contain over 100,000microarray gene expression profiles (Table 1). This is a considerable data resource. However the average number of arrays per study is only between 30 and 40 (Table 1). Given that the number of features (genes) on microarrays now exceeds 50,000, this presents a considerable dimensionality problem. Low case to feature ratio is likely to remain an issue, as cost and availability of biomaterial, such as biopsy tissue, are often limiting. As a result, meta-analysis or merging data from multiple studies is attractive.
منابع مشابه
Multiple Co-inertia Analysis of Multiple OMICS Data using omicade4
Multivariate approaches have been applied successfully in the analysis of high throughput ”omics”data. Principal component analysis (PCA) has been shown to be useful in exploratory analysis of linear trends in biological data [1]. Culhane and colleagues employed a two table coupling method (co-inertia analysis, CIA) to examine covariant gene expression patterns between microarray datasets from ...
متن کاملMADE4: an R package for multivariate analysis of gene expression data
SUMMARY MADE4, microarray ade4, is a software package that facilitates multivariate analysis of microarray gene-expression data. MADE4 accepts a wide variety of gene-expression data formats. MADE4 takes advantage of the extensive multivariate statistical and graphical functions in the R package ade4, extending these for application to microarray data. In addition, MADE4 provides new graphical a...
متن کاملInteractive Multivariate Data Analysis in R with the ade4 and ade4TkGUI Packages
ade4 is a multivariate data analysis package for the R statistical environment, and ade4TkGUI is a Tcl/Tk graphical user interface for the most essential methods of ade4. Both packages are available on CRAN. An overview of ade4TkGUI is presented, and the pros and cons of this approach are discussed. We conclude that command line interfaces (CLI) and graphical user interfaces (GUI) are complemen...
متن کاملAdenovirus vector E4 gene regulates connexin 40 and 43 expression in endothelial cells via PKA and PI3K signal pathways.
Connexins (Cxs) provide a means for intercellular communication and play important roles in the pathophysiology of vascular cardiac diseases. Infection of endothelial cells (ECs) with first-generation E1/E3-deleted E4+ adenovirus (AdE4+) selectively modulates the survival and angiogenic potential of ECs by as of yet unrecognized mechanisms. We show here that AdE4+ vectors potentiate Cx expressi...
متن کاملA Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification
In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analy...
متن کامل